Swivel: Improving Embeddings by Noticing What's Missing

نویسندگان

  • Noam Shazeer
  • Ryan Doherty
  • Colin Evans
  • Chris Waterson
چکیده

We present Submatrix-wise Vector Embedding Learner (Swivel), a method for generating lowdimensional feature embeddings from a feature co-occurrence matrix. Swivel performs approximate factorization of the point-wise mutual information matrix via stochastic gradient descent. It uses a piecewise loss with special handling for unobserved co-occurrences, and thus makes use of all the information in the matrix. While this requires computation proportional to the size of the entire matrix, we make use of vectorized multiplication to process thousands of rows and columns at once to compute millions of predicted values. Furthermore, we partition the matrix into shards in order to parallelize the computation across many nodes. This approach results in more accurate embeddings than can be achieved with methods that consider only observed cooccurrences, and can scale to much larger corpora than can be handled with sampling methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

What's in an Embedding? Analyzing Word Embeddings through Multilingual Evaluation

In the last two years, there has been a surge of word embedding algorithms and research on them. However, evaluation has mostly been carried out on a narrow set of tasks, mainly word similarity/relatedness and word relation similarity and on a single language, namely English. We propose an approach to evaluate embeddings on a variety of languages that also yields insights into the structure of ...

متن کامل

Automated Detection of Non-Relevant Posts on the Russian Imageboard "2ch": Importance of the Choice of Word Representations

This study considers the problem of automated detection of non-relevant posts on Web forums and discusses the approach of resolving this problem by approximation it with the task of detection of semantic relatedness between the given post and the opening post of the forum discussion thread. The approximated task could be resolved through learning the supervised classifier with a composed word e...

متن کامل

A small swivel joint for infusion of free moving animals.

construction and application of a small, light, inexpensive swivel joint suitable for infusions of small laboratory animals is described. Commercially available tubes and needles are used in the construction of the swivel thus making it easy to prepare and essentially disposable. This swivel is especially useful in long-and short-term infusions of pharmaca and nutrients in the general circulati...

متن کامل

DNA swivel enzyme activity in a nuclear membrane fraction

DNA swivel (nicking-rejoining) enzyme activity has been studied in various cell fractions of a human lymphoid cell line. Swivel activity is found only in chromatin and in a nuclear membrane fraction containing DNA and possessing endogenous DNA synthesizing activity. Twenty percent of the total swivel activity and less than one percent of the total DNA are in the membrane fraction. The swivel en...

متن کامل

Ensemble cryo-EM uncovers inchworm-like translocation of a viral IRES through the ribosome

Internal ribosome entry sites (IRESs) mediate cap-independent translation of viral mRNAs. Using electron cryo-microscopy of a single specimen, we present five ribosome structures formed with the Taura syndrome virus IRES and translocase eEF2•GTP bound with sordarin. The structures suggest a trajectory of IRES translocation, required for translation initiation, and provide an unprecedented view ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1602.02215  شماره 

صفحات  -

تاریخ انتشار 2016